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DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - (b) the invention was patented or described in a printed 
publication in this or a foreign country or in public use or on sale in this country, more than one year prior to 
the date of application for patent in the United States. 

2. Claims 43-49, 51-52, 54-55, 57-58, and 60-61 are rejected under 35 
U.S.C. 102(b) as being anticipated by Dragosh et al. (US Patent No. 6078886). 

3. Regarding claim 43, Dragosh et al. disclose a client-server speech recognition 
system comprising: the client comprising: a client device comprising: display control 
means for controlling the display of a speech input window comprising plural input forms 
(coL 9, lines 14-30 discuss a form-filling method. It is inherent that the client device 
would include software program controlling the display of forms)] determining means for 
determining from among the displayed plural input forms the input form to which speech 
information is input as a target speech input (col. 9, lines 14-30, a form includes multiple 
data fields with each data field being associated with a particular set of speech 
recognition grammars)] storing means for storing a user dictionary which holds target 
recognition words and input form identifying information in association with each other 
(col. 4, lines 27-44, speech recognition grammars transmitted to the server from client 
device)] speech receiving means for receiving speech information inputted by a speech 
input module (coL 3, lines 5-67)] first transmission means for transmitting input form 
identifying information indicating the input form to which speech information is 
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determined to be input by said determining means (col. 9, lines 14-30, the grammars 
associated with a particular form field), the user dictionary, and the speech information 
to the server (col. 4, line 27 to col. 6, line 67), and inputting means for inputting a 
speech recognition result received from the server to the input form to which speech 
information is determined to be input by said determining means (col. 9, lines 14-30)\ a 
server comprising: receiving means for receiving the input form identifying information, 
the user dictionary, and the speech information (server 100 in figure 1 and referring to 
col. 4, In. 27 to col. 6, In. 67); speech recognition means for recognizing the speech 
information using the target recognition words of the user dictionary associated with the 
input form to which speech is determined by said determining means to have been input 
that is identified by the received input from identifying information (col. 4, In. 27 to col. 7, 
In. 67); and second transmission means for transmitting the speech recognition result of 
said speech recognition means to the client (col. 7, In. 1 to col. 8, In. 67). 

4. Regarding claim 44, Dragosh et al. further disclose a system according to claim 
43, wherein said server further comprises holding means for holding a plurality of kinds 
of recognition dictionaries, and a table managing a correspondence of the input form 
identifying information and each of the plurality of kinds of recognition dictionaries, and 
wherein said speech recognition means selects a recognition dictionary corresponding 
to the received input form identifying information from said holding means by referring to 
the table, and recognizes the speech using the selected recognition dictionary and the 
selected target recognition word (col. 5, In. 1 to col. 6, In. 67). 
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5. Regarding claims 45, 51 , 57, and 60, Dragosh et al. disclose that in a client- 
server speech recognition system, method, and computer-readable memory, an 
information processing apparatus acting as the client comprising: display control means 
for controlling the display of a speech input window comprising plural input forms (col. 9, 
lines 14-30 discuss a form-filling method. It is inherent that the client device would 
include software program controlling the display of forms)] determining means for 
determining from among the displayed plural input forms the input form to which speech 
information is input as a target speech input (coL 9, lines 14-30, a form includes multiple 
data fields with each data field being associated with a particular set of speech 
recognition grammars)] storing means for storing a user dictionary which holds target 
recognition words and input form identifying information in association with each other 
(col. 4, lines 27-44, it is inherent that the client device has a memory for storing speech 
recognition grammars to be transmitted to the server to perform speech recognition)] 
speech receiving means for receiving speech inputted by a speech input module using 
an input form (col. 3, lines 5-67 and col. 9, lines 14-30 discuss a form-filling method)] 
transmission means for transmitting input form to which speech information is 
determined to be input by said determining means, the user dictionary which holds 
target recognition words and input form identifying information in association with each 
other (col. 9, lines 14-30, the grammars associated with a particular form field), and the 
speech information to the server (col. 4, line 27 to col. 6, line 67); and inputting means 
for inputting a speech recognition result received from the server to the input form to 
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which speech information is determined to be input by said determining means (co/. 9, 
lines 14-30). 

6. Regarding claims 46, 52, and 58, Dragosh et al. disclose that in a client-server 
speech recognition system, method, and computer-readable memory for recognizing, by 
a server, speech input at a client for inputting information to an input form, the client 
having a speech input window comprising plural input forms, an information processing 
apparatus acting as the server comprising: receiving means for receiving input form 
identifying information indicating the input from among the plural input forms in the 
speech window to which speech is inputted by the client, a user dictionary holding target 
recognition words and input form identifying information in association with each other 
{server 100 in figure 1 and referring to col. 4, line 27 to col. 6, line 67 and col. 9, lines 
14-30, the grammars associated with a particular form field)] speech recognition means 
for recognizing the speech information using the target recognition words of the user 
dictionary associated with the input form to which speech is inputted as identified by the 
received input form identifying information (col. 6, In. 1 to col. 7, In. 67); and 
transmission means for transmitting the speech recognition result of said speech 
recognition means to the client (col. 6, In. 1 to col. 7, In. 67). 

7. Regarding claim 47, Dragosh et al. disclose a client-server speech recognition 
system comprising: a client comprising: display control means for controlling the display 
of a speech input window comprising plural input forms (col. 9, lines 14-30 discuss a 
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form-filling method. It is inherent that the client device would include software program 
controlling the display of forms)] determining means for determining from among the 
displayed plural input forms the input form to which speech information is input as a 
target speech input (coL 9, lines 14-30, a form includes multiple data fields with each 
data field being associated with a particular set of speech recognition grammars)] 
storing means for storing a user dictionary which holds target recognition words and 
recognition dictionaries identifying information in association with each other (coL 4, 
lines 27-44, speech recognition grammars to be transmitted to the server to from the 
client device)] speech receiving means for receiving speech information inputted by a 
speech input module (coL 3, In. 5-67); and first transmission means for transmitting 
input form identifying information indicating the input form to which speech information 
is determined to be input by said determining means, the user dictionary, and the 
speech information to the server (coL 4, line 27 to col. 6, line 67 and col. 9, lines 14-30, 
a form includes multiple data fields with each data field being associated with a 
particular set of speech recognition grammars)] and inputting means for inputting a 
speech recognition result received from the server to the input form to which speech 
information is determined to be input by said determining means (col. 9, lines 14-30)] 
and a server comprising: holding means for holding a plurality of kinds of recognition 
dictionaries (Load Client Grammar 304 in figure 3); receiving means for receiving the 
input form identifying information, the user dictionary, and the speech information 
(server 100 in figure 1 and referring to col. 4, line 27 to col. 6, line 67); speech 
recognition means for selecting a recognition dictionary from said holding means 
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corresponding to the input form to which speech information is determined to be input 
by said determining means identified by the received input form identifying information 
(coL 9, lines 14-30, a plurality of form fields associated with individual sets of grammars 
are transmitted to the client server), selecting, from the received user dictionary, target 
recognition words associated with the recognition dictionary corresponding to the 
selected recognition dictionary identified by the recognition dictionaries identifying 
information (coL 4, line 27 to col. 7, line 67 and col. 9, lines 14-30, each said 
associated form-field-set-of-grammars is selected for used by a speech recognizer to 
recognize input speech for that particular form field)] and second transmission means 
for transmitting the speech recognition result of said speech recognition means to the 
client (coL 7, line 1 to col. 8, line 67). 

8. Regarding claims 48 and 54, Dragosh et al. disclose that in a client-server 
speech recognition system and method, an input information processing apparatus and 
method acting as the client comprising: display control means for controlling the display 
of a speech input window comprising plural input forms (coL 9, lines 14-30 discuss a 
form-filling method. It is inherent that the client device would include software program 
controlling the display of forms)] determining means for determining from among the 
displayed plural input forms the input form to which speech information is input as a 
target speech input (coL 9, lines 14-30, a form includes multiple data fields with each 
data field being associated with a particular set of speech recognition grammars)] 
storing means for storing a user dictionary which holds target recognition words and 
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recognition dictionaries identifying information in association with each other (coL 4, 
lines 27-44, speech recognition grammars to be transmitted to the server from client 
device)] speech receiving means for receiving speech information inputted by a speech 
input module using an input form (coL 3, lines 5-67); transmission means for 
transmitting input form to which speech information is determined to be input by said 
determining means, a user dictionary which holds target recognition words and 
recognition dictionaries identifying information in association with each other (coL 9, 
lines 14-30, each set of speech recognition grammars is associated with a particular 
form field)\ and inputting means for inputting a speech recognition result received from 
the server to the input form to which speech information is determined to be input by 
said determining means (coL 9, lines 14-30); and a server comprising: holding means 
for holding a plurality of kinds of recognition dictionaries {col. 9, lines 14-30, each form 
field is associated with a particular set of speech recognition grammars transmitted to 
the server); receiving means for receiving the input form identifying information, the user 
dictionary, and the speech information (coL 4, line 27 to col. 6, line 67); speech 
recognition means for selecting a recognition dictionary corresponding to the input form 
in the speech input window of the client device to which speech information is input 
identified by the received input form identifying information from a plurality of kinds of 
recognition dictionaries held by the server (coL 4, line 27 to col. 7, line 67 and col. 7, line 
1 to col. 8, line 67 and col. 9, lines 14-30, each form field is associated with a particular 
set of speech recognition grammars. The server selects the associated set of 
grammars to perform speech recognition for each form field), selecting, from the 
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received user dictionary, target recognition words associated with the recognition 
dictionary corresponding to the selected recognition dictionary identified by the 
recognition dictionaries identifying information (coL 9, lines 14-30, each form field is 
associated with a particular set of speech recognition grammars. The server selects the 
associated set of grammars to perform speech recognition for each form field), and 
recognizing the speech information using the selected recognition dictionary and the 
selected target recognition words (coL 9, lines 14-30, each form field is associated with 
a particular set of speech recognition grammars. The server selects the associated set 
of grammars to perform speech recognition for each form field); and transmission 
means for transmitting the speech recognition result of said speech recognition means 
to the client (col. 9, lines 14-30). 

9. Regarding claims 49, 55 and 61 , Dragosh et al. disclose that in a client-server 
speech recognition system and method for recognizing, by a server, speech input at a 
client for inputting information to an input form, the client having a speech input window 
comprising plural input forms, an information processing apparatus acting as the server 
comprising: holding means for holding a plurality of kinds of recognition dictionaries 
(Load Client Grammar 304 in figure 3); receiving means for receiving input form 
identifying information from among the plural input forms in the speech input window of 
the client to which speech information is input, a user dictionary, and the speech 
information from the client, the user dictionary holding a target recognition word and 
recognition dictionaries identifying information in association with each other (server 100 



Application/Control Number: 09/993,570 Page 10 

Art Unit: 2655 

in figure 1 or referring to col. 4, line 27 to col. 7, line 67 and col. 9, lines 14-30, each 
form field is associated with a particular set of speech recognition grammars)] a speech 
recognition step of selecting a recognition dictionary corresponding to the input form in 
the speech input window of the client device to which speech information is input 
identified by the recieived input form identifying information from a plurality of kinds of 
recognition dictionaries held by the server (col. 4, line 27 to col. 7, line 67 and col. 7, line 
1 to col. 8, line 67 and col. 9, lines 14-30, each form field is associated with a particular 
set of speech recognition grammars. The server selects the associated set of 
grammars to perform speech recognition for each form field), selecting, from the 
received user dictionary, target recognition words associated with the recognition 
dictionary corresponding to the selected recognition dictionary identified by the 
recognition dictionaries identifying information (col. 9, lines 14-30, each form field is 
associated with a particular set of speech recognition grammars. The server selects the 
associated set of grammars to perform speech recognition for each form field), and 
recognizing the speech information using the selected recognition dictionary and the 
selected target recognition words (col. 9, lines 14-30, each form field is associated with 
a particular set of speech recognition grammars. The server selects the associated set 
of grammars to perform speech recognition for each form field)] and transmission 
means for transmitting the speech recognition result of said speech recognition means 
to the client (col. 9, lines 14-30). 
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Conclusion 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen X. Vo whose telephone number is 571-272-7631 . 
The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached on 571-272-7582. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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